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ABSTRACT 

Time-varying  short-term  spectral  estimates  have  been  success¬ 
fully  applied  in  many  classification  tasks.  However,  they  are  still 
insufficient  for  many  non-stationary  signals  where  time-varying 
information  is  useful.  In  this  paper,  we  propose  to  improve  the 
deficiencies  of  current  short-tenn  feature  analysis  by  adding 
information  to  describe  the  time-varying  behavior  of  the  signals. 
Our  proposed  method  which  is  motivated  by  the  human  auditory 
system  can  be  applied  to  several  non-stationary  signal  types. 
Real  world  communication  signals  were  used  for  experimental 
verification.  These  experimental  results,  assessed  with  a  conven¬ 
tional  probabilistic  classifier,  showed  significant  improvement 
when  the  new  features  were  added  to  short-tenn  spectral  esti¬ 
mates. 

1.  INTRODUCTION 

In  automatic  classification,  the  goal  is  to  have  a  machine 
characterize  events  and  make  an  appropriate  decision  about  the 
class  of  these  events.  Short-term  estimates,  for  example  Fourier 
or  autocorrelation  coefficients,  have  been  widely  used  for  many 
signal  types  (e.g.  [1,  2]).  Basically,  the  signal  is  blocked  into 
successive  frames  using  a  small  data  analysis  window.  This 
blocking  assumes  stationarity  of  the  signal  within  each  frame. 
The  windowed  signal  is  then  transfonned  into  a  new  representa¬ 
tion  hopefully  giving  good  discrimination  and/or  energy  compac¬ 
tion.  The  length  of  a  short-term  analysis  window  can  be  different 
depending  on  the  signal  type.  For  example  in  tool-wear  classifi¬ 
cation  [3],  stationarity  can  be  assumed  within  a  quarter  revolu¬ 
tion  of  a  tool.  Since  a  working  tool  has  different  sizes  and  rota¬ 
tion  speeds,  a  quarter  revolution  time  window  results  in  different 
time  durations,  20-40  ms.  In  speech  recognition  [4],  the  typical 
short-term  window  used  for  speech  analysis  is  about  20-30  ms. 
In  music  classification,  the  length  of  a  short-tenn  window  is 
influenced  by  audio  coding  where  typically  two  processing  win¬ 
dows  are  typically  used  [5],  With  a  sampling  rate  of  48  KHz, 
windows  of  size  256  samples,  or  about  5  ms,  and  2048  samples, 
or  about  43  ms,  are  commonly  applied.  Unless  specified  other¬ 
wise,  we  will  refer  a  data  window  of  length  less  than  50  ms  as  a 
short-term  window. 

In  this  paper,  we  model  non-stationary  signals,  such  as 
speech,  music,  or  communication  signals,  as  the  product  of  a 
narrow  bandwidth  lowpass  process  m(t)  modulating  a  higher 
bandwidth  carrier  c(t) ,  as  shown  in  Equation  (1).  To  effectively 
use  this  model,  m(t)  is  assumed  to  be  nonnegative  and  its 
bandwidth  does  not  overlap  with  c(t) . 

x(t)  =  m(t)c(t)  (1) 

The  above  model  has  been  applied  to  encode  x{t)  in  speech  [6] 
and  audio  [7],  Motivated  from  the  success  in  coding,  this  model 
can  also  be  useful  for  non-stationary  signal  classification.  An 
important  question  can  be  raised  whether  more  traditional  feature 
extraction  such  as  short-term  spectral  analysis  is  adequate  for 
extracting  the  pertinent  information  of  this  model.  Since  m(t)  is 


a  slowly  varying  signal,  using  too  short  of  an  analysis  window 
can  be  insufficient  to  model  m(t)  . 

Understanding  the  human  recognition  system  and  integrat¬ 
ing  the  relevant  aspects  significantly  contributes  to  the  under¬ 
standing  of  this  form  of  feature  extraction.  For  example  in  [4],  to 
estimate  energy  at  the  most  sensitive  modulation  frequency  of 
human  audition,  about  4  Hz,  an  analysis  window  of  at  least  250 
ms  is  needed.  A  related  concept  on  a  perceptual  duration  of  audi¬ 
tory  system  is  a  pre-perceptual  auditory  image  [8],  This  concept 
refers  to  the  process  where  an  auditory  input  produces  an  audi¬ 
tory  image  containing  information  about  a  stimulus.  Because  of 
the  continuous  change  of  the  auditory  input,  a  pre-perceptual 
auditory  store  is  used  to  hold  information  of  the  stimulus  and  can 
be  utilized  later.  Massaro  [8]  proposed  the  estimation  of  this 
perceptual  unit  using  a  backward  masking  experiment.  Results 
suggested  that  pre-perceptual  auditory  storage  and  processing 
was  over  200  ms  which  is  again  longer  than  a  short-tenn  analysis 
window.  Finally,  the  sensitivity  of  short-term  features  to  noise 
[9]  and  unseen  testing  data  [3]  is  another  deficiency. 

To  improve  the  deficiencies  of  short-term  feature  analysis, 
we  propose  long-tenn  feature  analysis  called  joint  frequency 
analysis.  Joint  frequency  analysis  not  only  contains  short-term 
information  about  the  signal,  but  also  contains  long-tenn  infor¬ 
mation  representing  patterns  of  time  variation.  We  will  show  that 
our  proposed  feature  analysis  is  complimentary  to  more  tradi¬ 
tional  short-tenn  feature  analysis.  Communication  signal  classi¬ 
fication  is  used  for  experimental  verification.  Related  work  [10] 
using  joint  frequency  analysis  in  a  communication  signal  inter¬ 
ception  application  used  a  method  requiring  a  priori  infonnation 
such  as  a  symbol  rate.  To  avoid  the  assumption  of  prior  infonna¬ 
tion,  we  use  time-frequency  theory  integrated  with  psychoacous¬ 
tic  results  on  modulation  frequency  perception  to  provide  a 
foundation  and  subsequent  joint  frequency  representation  for  a 
classification  system. 

2.  JOINT  ACOUSTIC  AND  MODULATION 
FREQUENCY  ANALYSIS 

2.1  Theory 

One  possible  joint  frequency  representation,  P(p,a> )  ,  is  a  trans- 
fonn  in  time  of  a  demodulated  short-time  spectral  estimate.  For 
the  purpose  of  this  paper,  a>  and  p  are  “acoustic  frequency” 
and  “modulation  frequency,”  respectively.  A  spectrogram,  or 
other  joint  time-frequency  representations  [11],  can  be  used  as 
the  starting  point  of  this  analysis.  In  this  paper,  we  first  use  a 
spectrogram  with  an  appropriately  chosen  window  length  to 
estimate  a  joint  time-frequency  representation  of  the  sig¬ 
nal,  PxF(t,co)  .  Second,  another  transform  (e.g.  Fourier)  is  applied 
along  the  time  dimension  of  the  spectrogram  to  estimate 
PxSF(p,co)  .  Another  way  of  viewing  PxF{p,co) ,  as  shown  in 
Equation  (2),  is  the  convolution  in  co  and  multiplication  in  p  of 
the  correlation  function  of  a  Fourier  transform  of  the  signal  x(t ) 
and  the  underlying  data  analysis  window  h(t)  : 
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Pf{r],a)  =  \^H\a>-^)H(e>  +  ^.^X\w-^)X(a>  +  ^)  j.  (2) 

To  illustrate  the  behavior  of  Pxp(rj,a>)  ,  an  AM  signal  is  used. 
Equation  (3)  describes  a  spectrogram  of  the  AM  signal,  where 
com  and  coc  are  the  modulation  and  carrier  frequencies,  respec¬ 
tively. 

PxSP(t,a)  =  (l  +  cosmj)  d((o±  coc)  (3) 

Applying  a  Fourier  transform  along  the  time  dimension  of 

If  if, a)  : 

07  >®)  =  \jT{J’ai)e,'"dt  (4) 

=  S(tj)  +  S(rj  ±  a>m)  +  —S(rj  ±  2com)^5(co  ±  coc). 

Pxp(rj,co)  results  in  the  compaction  of  non-zero  values  occur¬ 
ring  at  low  rj ,  \rj\  <  2 com  ,  which  can  be  advantageous  for  coding 
and  classification.  When  using  a  joint  frequency  representation 
as  features  in  non-stationary  signal  classification,  only  positive 
and  low  modulation  frequencies  are  needed  for  estimating  the 
low-dimensional  features  because  of  the  symmetric  property  in 
frequency  and  small  non-zero  support  region  of  Pxr(r/,co)  .  The 
modulation  frequency  range  of  interest  in  Pxp(tj,o>)  can  be  de¬ 
termined  by  the  assumed  highest  modulation  frequency  in  the 
signal  or  the  bandwidth  of  the  chosen  spectrogram  window. 

2.2  Interpretation 

In  Equation  (2),  PxSF(r/,a>)  values  along  rj  =  0  are  the  convolu¬ 
tion  of  the  spectra  of  the  spectrogram  window  and  the  signal. 

Pf(0,co)  =  ]tf(ffl)f  *jx(<y)|z  (5) 

The  values  lying  along  r/  =  0  are  an  averaged  short-term  spec¬ 
tral  estimate  of  the  signal.  The  length  of  the  spectrogram  window 
and  the  amount  of  overlap  between  data  analysis  windows  de¬ 
termine  the  trade-offs  between  the  bias  and  variance  of  the  short¬ 
term  spectral  estimate.  Pfp(rj,a>)  at  rj  =  0  is  an  estimate  of  the 
stationary  information  while  the  PxF(r/,a>)  at  r/^0  is  an  esti¬ 
mate  of  the  non-stationary  information  about  the  signal.  This 
non-stationary  infonnation  can  represent  various  quantities  (e.g. 
a  symbol  rate  of  a  communication  signal). 

In  this  paper,  joint  frequency  analysis  is  applied  to  the 
automatic  classification  of  unknown  communication  signals.  For 
digital  communication  signals  such  as  frequency  shift  keying 
(FSK)  or  phase  shift  keying  (PSK),  the  message  is  encoded  as 
the  change  of  frequency  or  phase  of  the  carrier,  respectively.  For 
the  spectrogram  of  FSK  as  shown  in  Figure  la,  the  transmitted 
signal  is  sent  by  switching  between  two  frequencies,  1200  Hz 
and  1500  Hz.  Since  filterbank  analysis  is  effected  via  a  spectro¬ 
gram,  the  change  of  frequency  is  transformed  into  a  change  of 
magnitude  in  each  subband  of  the  joint  time-frequency 
representation.  The  non-zero  terms  in  Pf  (t},aj)  as  shown  in 
Figure  lb  reveal  how  fast  the  instantaneous  frequency  of  this 
signal  pass  through  the  subband  filters.  The  non-zero  tenns 
occurring  at  harmonics  of  approximately  30  Hz  in  the  r/ 
dimension  of  Pxsp(r/,co)  reflect  the  symbol  rate  of  this  FSK 
signal.  For  the  PSK  signal,  the  phase  change  of  the  carrier  also 
contributes  to  the  change  of  the  instantaneous  amplitude.  The 
spectrogram  of  PSK  as  illustrated  in  Figure  2a  shows  the  signal 
having  high  bandwidth  in  (O  and  random  behavior  in  time. 
Pxp(r/,a>)  of  this  PSK  signal,  as  shown  in  Figure  2b,  exhibits  a 
more  compact  energy  representation.  There  are  non-zero  terms 


energy  representation.  There  are  non-zero  terms  occurring  at 
harmonics  of  approximately  15  Hz  in  the  rj  dimension.  For  a 
multilevel  modulation  signal  such  as  multilevel  FSK  (MFSK), 
the  non-zero  terms  will  appear  in  more  acoustic  frequency  sub¬ 
bands  than  an  FSK  signal. 

As  demonstrated  with  real-world  signals,  joint  frequency 
analysis  has  the  potential  to  extract  time- varying  infonnation  via 
the  non-zero  terms  in  the  representation.  These  non-zero  terms 
are  possibly  useful  for  discriminating  signal  types,  thus  they 
should  be  considered  as  useful  features.  However,  using 
Pxp(rj,co)  directly  for  classification  has  an  important  disadvan¬ 
tage.  Pxp(r/,a>) ,  as  well  as  other  two-dimensional  analysis,  is  a 
long-term  analysis,  therefore  it  provides  an  extremely  large  di¬ 
mension  compared  to  traditional  short-term  spectral  estimate. 
Even  though  we  can  reduce  the  feature  dimension  due  to  the 
symmetry  in  frequency  and  small  non-zero  support  region  of 
Pxp(rj,co)  ,  the  resulting  dimensions  are  still  too  large  for  typical 
classifiers.  Past  research  has  addressed  the  method  of  reducing 
feature  dimension  of  a  two  dimensional  representation  in  various 
ways.  For  example,  many  methods  view  a  two-dimensional  sig¬ 
nal  representation  as  an  image.  The  non-zero  terms  lying  in  the 
representation  then  can  be  viewed  as  the  lines  or  objects  and  the 
small  set  of  descriptors  being  invariant  to  translation,  rotation,  or 
scaling  can  be  extracted.  Since  we  are  interested  in  tasks  where 
human  auditory  signal  classification  is  largely  successful,  inte¬ 
grating  psychoacoustic  results  into  the  analysis  can  possibly 
provide  added  advantages  in  feature  design  and  selection. 

2.3  Modulation  scale 

Using  Fourier  analysis  for  the  modulation  frequency  transform  in 
the  above  analysis  results  in  a  uniform  frequency  bandwidth  in 
modulation  frequency;  however  this  approach  for  modulation 
decomposition  can  be  inefficient  for  auditory  classification  due 
to  the  resulting  high  dimensionality.  Furthermore,  the  uniform 
bandwidth  in  modulation  frequency  does  not  mimic  the  human 
auditory  system.  Recent  psychoacoustic  results  [12]  suggest  that 
a  log  frequency  scale,  with  a  constant-0  over  the  entire  fre¬ 
quency  range,  best  mimics  human  perception  of  modulation  fre¬ 
quency.  Our  approach  uses  a  continuous  wavelet  transform 
(CWT)  to  efficiently  approximate  this  constant-0  effect.  Our 
method,  called  modulation  scale  analysis,  starts  with  a  standard 
spectrogram  of  x(t)  : 

1  i  1 2 

PxF(t,co)  =  — \{x(u)h’(u-t)e~J audu\  .  (6) 

2n  ' 

For  discrete  scales  $  ,  the  wavelet  filter  is  applied  along 
each  temporal  row  of  the  spectrogram  output: 

P?(s,£,a)  =  -\Ff  .  (7) 

The  energy  across  the  wavelet  translation  axis  £  is  integrated: 

Pf(s,co)  =  J|pf(^,®)|2^.  (8) 

The  above  equation  yields  a  joint  frequency  representation  with 
non-uniform  resolution  in  the  modulation  frequency  dimension, 
as  indexed  by  the  discrete  scale  5  .  Our  past  work  [9]  showed  the 
advantage  of  using  wavelet  over  Fourier  bases  for  discriminating 
two  distinct  modulation  frequencies  when  the  dimension  of  their 
representations  is  the  same. 


2.4  Time  and  frequency  translation 

A  practical  classification  system  needs  to  be  robust  to  changes  in 
the  signal.  In  this  paper,  we  address  the  robustness  of  our  analy¬ 
sis  to  time  and  frequency  translation.  If  the  signal  is  shifted  in 
time,  t0 ,  and  in  acoustic  frequency,  coa ,  i.e.  y(t)  =  eJm°'x(t  - 10 ) , 
then  the  joint  frequency  representation  of  the  shifted  signal  can 
be  expressed  as 

PyP(V,a>)  =  e  im,'P^P(r/,(o-(j0) .  (9) 

The  effect  of  a  time  shift  results  in  a  phase  shift  in  the  r/  dimen¬ 
sion  of  Pfp(tj,co) .  By  using  the  absolute  value  of  Ptsp(rj,a)  ,  the 
estimate  is  invariant  to  the  time  shift.  For  the  modulation  scale 
representation  PlSF(s,a>)  ,  if  the  time  shift  is  relatively  small 
compared  to  the  integration  time  in  Equation  (8),  then  the  esti¬ 
mate  is  approximately  invariant  to  the  effect: 

=  Pxsp(s,co-ca0).  (10) 

An  acoustic  frequency  shift  causes  a  shift  in  the  co  dimension 
for  both  Pvsp(tj,co)  and  PtSP(s,a))  .  We  will  discuss  the  approach 
for  estimating  features  which  are  insensitive  to  this  frequency 
shift  in  Section  3.3. 


3.3  Feature  Extraction 

Each  file  was  resampled  to  11025  Hz.  After  removing  silences, 
the  resampled  audio  was  then  windowed  into  3-second  blocks 
which  overlapped  by  2.75  seconds.  A  spectrogram  was  computed 
for  each  block  using  a  Hanning  window  of  length  128  samples 
and  a  window  shift  of  21  samples  thereby  reducing  the  subband 
sampling  rate  to  about  512  Hz  before  the  modulation  transform. 
Biorthogonal  wavelet  filters,  with  8  different  dyadic  scales,  were 
used  to  produce  one  modulation  frequency  vector  for  each  acous¬ 
tic  subband.  The  output  features  included  modulation  scale 
[s,  A:] ,  and  spectral  estimate  P  [k]  features. 

As  previously  discussed,  the  nature  of  the  initial  demodula¬ 
tion  process  may  induce  an  acoustic  frequency  shift  in  Pmod  and 
Pspec .  Due  to  this  frequency  translation,  we  cannot  directly  apply 
Pmod  and  Pspec  to  typical  classifiers.  To  remove  the  frequency 
sensitivity  in  Pmod  ,  a  singular  value  decomposition  (SVD)  was 
considered.  Using  the  SVD,  we  can  estimate  the  acoustic  fre¬ 
quency  vector,  P“ ,  and  modulation  scale  vector,  Pm ,  given 
Pmod  ,  the  feature  matrix  with  rankr  ,  by  Equation  (11)  (where  a 
is  a  nonnegative  weight). 

PmJ.sM  =  (11) 


3.  EXPERIMENTS 

3.1  Signal  Interception 

There  are  many  modulation  types  used  to  transmit  digital  mes¬ 
sages  over  analog  channels.  In  many  applications  such  as  inter¬ 
ception  of  battlefield  communications,  modulation  type  is  un¬ 
known,  and  identification  of  the  type  is  a  critical  first  step  in 
monitoring  of  the  communication  channel.  Current  systems  rely 
on  the  accuracy  of  a  human  listener.  An  operator  manually  de¬ 
modulates  an  intercepted  signal  to  the  audible  frequency  range  at 
which  point  an  expert  listener  then  identifies  the  modulation 
type.  Past  research  into  automatic  identification  of  modulation 
type  has  used  a  combination  of  short-term  spectral  features,  i.e. 
[2],  A  weakness  of  features  of  this  type  is  that  they  are  sensitive 
to  frequency  translations  which  may  be  induced  during  the  initial 
demodulation  of  the  signal  into  the  audible  range  by  the  human 
operator.  Although  it  is  possible  to  align  the  center  frequencies 
of  signals  to  eliminate  this  drawback,  doing  so  requires  a  rather 
exact  estimate  of  the  carrier  frequency  [13].  We  will  show  that 
our  features  derived  from  joint  frequency  analysis  not  only  pro¬ 
vide  high  discrimination  but  also  low  sensitivity  to  frequency 
shift. 

3.2  Data  Collection 

The  real  communication  signals  used  in  the  experiments  were 
collected  from  http://rover.vistecprivat.de/~signals.  The  audio 
data  was  labeled  by  an  expert  listener  afterward.  There  were  a 
total  216  files  including  four  different  modulation  classes,  FSK, 
MFSK,  PSK  (phase  shift  keying  and  multilevel  PSK),  and 
MCVFT  (multichannel  FSK  and  PSK).  Each  file  had  different 
length  and  sampling  rate.  The  number  of  files  for  each  signal 
class  is  shown  in  Table  1 . 

Table  1:  The  number  of  communication  signal  files  used  in 
the  experiments. 


FSK  MFSK  PSK  MCVFT 
Number  of  files  77  41  72  26 


Using  Equation  (10),  the  modulation  scale  features  of  the  shifted 
signal  can  be  approximated  as  Pmod[s,k  -k0\  where  k0  is 
amount  of  frequency  translation.  Due  to  the  sparseness  of  the 
joint  frequency  representation,  Pmod[s ,k  -  k^]  could  be  viewed  as 
a  vertical  shift  of  the  non-zero  support  region  of  Figure  lb  and 
Figure  2b  while  the  structure  in  the  modulation  frequency  di¬ 
mension,  or  horizontal  axis  in  the  plots,  remains  the  same.  This  is 
equivalent  to  a  row  pennutation  of  the  Pmod  matrix.  It  can  be 
shown  that  a  row-permutation  in  this  matrix  results  in  a  row  per¬ 
mutation  of  the  left  matrix  in  the  SVD  which  implies  that  the 
frequency  shift  affects  only  P“ .  Since  P"'  and  a  were  insensi¬ 
tive  to  acoustic  frequency  shift,  they  were  chosen  for  our  fea¬ 
tures.  Because  Pmod  can  be  mostly  represented  using  only  one 
basis  vector,  we  derived  our  new  8  dimensional  modulation  fea¬ 
tures  as 


modulation  features 


(12) 


For  the  Pspec  vector,  four  scalar  features  which  were  also  insensi¬ 
tive  to  frequency  shift  were  extracted  including  entropy  and 
bandwidth  of  Pspec ,  and  the  mean  and  standard  deviation  of  the 
demodulated  spectrum  estimate.  The  normalized  central  second 
moment  of  the  analytic  signal  [2]  was  also  used  for  our  features. 
We  refer  to  these  five  features  collectively  as  "spectral  estimate" 
features.  To  reduce  the  dynamic  range  of  the  estimation,  all  fea¬ 
tures  were  normalized  by  the  standard  deviation  estimated  from 
all  signal  classes  before  classification. 

3.4  Results 

A  probabilistic  classifier,  Gaussian  mixture  model  (GMM),  was 
used  for  our  experiments.  Due  to  the  limited  number  of  examples 
in  each  signal  class,  the  method  of  withholding  each  example  in 
turn  for  testing  while  training  on  all  the  rest  ("leave-one-out" 
approach),  was  employed  to  evaluate  the  performance  for  the 
classifier.  For  a  given  multi-frame  test  file,  each  frame  was  as¬ 
signed  to  one  of  the  four  modulation  classes  having  the  highest 
likelihood  score.  The  modulation  class  winning  the  largest  num¬ 
ber  of  frame  assignments  was  chosen  as  the  class  for  that  exam- 
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pie.  Table  2  shows  the  accuracy  results  with  different  feature 
types  and  number  of  mixtures.  The  results  show  the  improved 
perfonnance  when  the  number  of  mixtures  is  increased.  Using 
short-term  spectral  estimate  features  exclusively  provides  per¬ 
fonnance  comparable  to  using  only  modulation  features.  How¬ 
ever,  adding  modulation  features  to  spectral  estimate  features 
yielded  markedly  improved  performance  for  all  numbers  of  mix¬ 
tures.  The  improved  perfonnance  of  adding  modulation  features 
also  was  seen  for  a  ^-nearest  neighbor  classifier. 


Table  2:  Percentage  accuracy  results  using  Gaussian  mixture 
models 


Features 

Number  of  mixtures 

2 

4 

6 

8 

10 

modulation 

51.9% 

58.3% 

62.0% 

61.6% 

65.7% 

spectral 

61.6% 

60.7% 

62.5% 

66.2% 

66.2% 

combined 

66.2% 

69.9% 

71.8% 

78.2% 

75.5% 

4.  CONCLUSIONS 

This  paper  discussed  joint  frequency  analysis  for  non-stationary 
signal  classification.  Our  joint  frequency  representation  has  pro¬ 
vided  not  only  short-tenn  information  but  also  long-term  infor¬ 
mation  about  the  signal.  Since  it  was  designed  using  time- 
frequency  theory  and  psychoacoustic  results,  the  resulting  ap¬ 
proach  has  potential  for  a  wide  range  of  non-stationary  signal 
types.  Communication  signal  type  classification  was  used  for 
experimental  verification.  The  experimental  results  using  the 
probabilistic  classifier  showed  that  adding  modulation  features  to 
spectral  estimate  features  significantly  improved  the  overall  sys¬ 
tem  performance.  For  the  best  performance,  the  relative  im¬ 
provement  was  18%  for  a  GMM  classifier  with  8  mixtures. 
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Figure  1:  A  spectrogram  representation  of  FSK  signal 
shows  the  change  of  two  frequencies  in  (a)  where  the  joint 
frequency  representation  of  this  signal  exhibits  a  more 
compact  representation  in  (b).  More  details  are  in  Section 
2.2. 


Figure  2:  A  spectrogram  representation  of  PSK  signal 
shows  varying  content  in  a  wide  range  of  frequency  and 
time  in  (a)  where  the  joint  frequency  representation  of  this 
signal  exhibits  a  more  compact  representation  in  (b).  More 
details  are  in  Section  2.2. 


